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Abstract 

This paper shows how DATR, a widely used 
formal language for lexical knowledge rep- 
resentation, can be used to define an LTAG 
lexicon as an inheritance hierarchy with in- 
ternal lexical rules. A bottom- up featural 
encoding is used for LTAG trees and this al- 
lows lexical rules to be implemented as co- 
variation constraints within feature struc- 
tures. Such an approach eliminates the 
considerable redundancy otherwise associ- 
ated with an LTAG lexicon. 

1 Introduction 

The Tree Adjoining Grammar (T AG) formalism wa s 
first introduced two decades ago (Joshi ct al., 1975), 
and since then there has been a steady stream of 
theoretical work using the formalism. But it is 
only more recently that grammars of non-trivial 
size have be en de veloped: Abeille, Bishop, Cote 
& Schabes ( 1990 ) describe a feature-based Lexi- 
calized Tree Adjoining Grammar (LTAG) for En- 
glish which subsequently became the basis for the 
grammar used in the XTAG system, a wide-coverage 
LTAG p arser (Doran ct al., 1994b; |Doran et al 



19944 |XTAG Research Group, 199jf T The ad- 
vent of such large grammars gives rise to questions 
of efficient representation, and the fully lexicalized 
character of the LTAG formalism suggests that re- 
cent research into lexical representation might be a 
pla ce to look for answers (see for exam ple Briscoe et 
aZ.(1993); Daelemans & Gazdar(1992)). In this pa- 
per we explore this suggestion by showing how the 
lexical knowledge representation language (lkrl) 
DATR (|Evans & Gazdar, 1989a|; |Evans & Gazdar 



1989b) can be used to formulate a compact, hierar- 
chical encoding of an LTAG. 

The issue of efficient representation for LTAG0 is 

x As with all fully lexicalized grammar formalisms, 
there is really no conceptual distinction to be drawn in 
LTAG between the lexicon and the grammar: the gram- 
matical rules are just lexical properties. 



discussed by Vijay-Shanker & Schabes ( |1992| ), who 
draw attention to the considerable redundancy in- 
herent in LTAG lexicons that are expressed in a flat 
manner with no sharing of structure or properties 
across the elementary trees. For example, XTAG cur- 
rently includes over 100,000 lexemes, each of which 
is associated with a family of trees (typically around 
20) drawn from a set of over 500 elementary trees. 
Many of these trees have structure in common, many 
of the lexemes have the same tree families, and many 
of the trees within families are systematically related 
in ways which other formalisms capture using trans- 
formations or metarules. However, the LTAG formal- 
ism itself does not provide any direct support for 
capturing such regularities. 

Vijay-Shanker & Schabes address this problem 
by introducing a hierarchical lexicon structure with 
monotonic inheritance and lexical rules, using an ap - 
proach loosely based on that of Flickinger (1987) 
but tailored for LTAG trees rathe r than HPSG sub- 
categorization lists. Becker ( [1993 ; 1994) proposes a 
slightly different solution, combining an inheritance 
component and a set of metarules^]. We share their 
perception of the problem and agree that adopting 
a hierarchical approach provides the best available 
solution to it. However, rather than creating a hi- 
erarchical lexical formalism that is specific to the 
LTAG problem, we have used DATR, an LKRL that is 
already quite widely known and used. From an LTAG 
perspective, it makes sense to use an already avail- 
able lkrl that was specifically designed to address 
these kinds of representational issues. From a DATR 
perspective, LTAG presents interesting problems aris- 
ing from its radically lexicalist character: all gram- 
matical relations, including unbounded dependency 
constructions, are represented lexically and are thus 
open to lexical generalization. 

There are also several further benefits to be gained 
from using an established general purpose lkrl such 
as DATR. First, it makes it easier to compare the 
resulting LTAG lexicon with those associated with 



2 See Section ^ for further discussion of these 
approaches. 
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other types of lexical syntax: there are existing DATR 
lexicon fragments for HPSG, PATR and Word Gram- 
mar, among others. Second, DATR is not restricted 
to syntactic description, so one can take advantage 
of existing analyses of other levels of lexical descrip- 
tion, such as phonology, prosody, morphology, com- 
positional semantics and lexical semantics^]. Third, 
one can exploit existing formal and implementation 
work on the language^. 

2 Representing ltag trees 



NPj 




P o NP| 
Figure 1: An example LTAG tree for give 

The principal unit of (syntactic) information as- 
sociated with an LTAG entry is a tree structure in 
which the tree nodes are labeled with syntactic cat- 
egories and feature information and there is at least 
one leaf node labeled with a lexical category (such 
lexical leaf nodes are known as anchors). For exam- 
ple, the canonical tree for a ditransitive verb such as 
give is shown in figure |l|. Following LTAG conventions 
(for the time being), the node labels here are gross 
syntactic category specifications to which additional 
featural information may be added0, and are anno- 
tated to indicate node type: o indicates an anchor 



See, fo r exa mple. Bleiching ( 1992 l 1994 ) , Brown 



199C 



Hipp isley ( |1994| ), Corbett fc Fr aser( |l993^ 7Cahill 
1993) , CamTT& Evan s ( |L990| ), Fr aser fc Corbett in 



press! ), Gibbo n ( |L992[ ), Kilgarriff ( |l993|) Kilgarriff 
Gazdar ( |l995| ), Reinhard & Gibbon (Si). 



4 See, for examp le. An drv et al. (1992) on compila- 
tion, Kilbur y et al. (1991) on coding DAGs, Duda fc Geb- 



hardi ( 1994 ) on dynamic q ueryi ng, Langer ( 199 
vers e que rying, and Barg (h994 [). Lig ht (1994 J 
al. (1993) and Kilbury et a, 



ight et 



(119941) on automatic ac- 
quisition. And there are at least a dozen different DATR 
implementations available, on various platforms and pro- 
gramming languages. 

5 In fact, LTAG commonly distinguishes two sets of 
features at each node (top and bottom), but for simplic- 
ity we shall assume just one set in this paper. 



node, and j. indicates a substitution node (where a 
fully specified tree with a compatible root label may 
be attached)^. 

In representing such a tree in DATR, we do two 
things. First, in keeping with the radically lexical- 
ist character of LTAG, we describe the tree structure 
from its (lexi cal) a nchor upwards^, using a variant 
of Kilbury's (1990) bottom-up encoding of trees. In 
this encoding, a tree is described relative to a partic- 
ular distinguished leaf node (here the anchor node) , 
using binary relations parent, left and right, re- 
lating the node to the subtrees associated with its 
parent, and immediate-left and -right sisters, en- 
coded in the same way. Second, we embed the result- 
ing tree structure (i.e., the node relations and type 
information) in the feature structure, so that the tree 
relations (left, right and parent) become features. 
The obvious analogy here is the use of f irst/rest 
features to encode subcategorisation lists in frame- 
works like HPSG. 

Thus the syntactic feature information directly as- 
sociated with the entry for give relates to the label 
for the v node (for example, the value of its cat fea- 
ture is v, the value of type is anchor), while speci- 
fications of subfeatures of parent relate to the label 
of the VP node. A simple bottom-up DATR represen- 
tation for the whole tree (apart from the node type 
information) follows: 

Give : 

<cat> = v 
<parent cat> = vp 
<parent left cat> = np 
<parent parent cat> = s 
<right cat> = np 
<right right cat> = p 
<right right parent cat> = pp 
<right right right cat> = np. 

This says that Give is a verb, with VP as its par- 
ent, an S as its grandparent and an NP to the left 
of its parent. It also has an np to its right, and a 
tree rooted in a P to the right of that, with a pp 
parent and NP right sister. The implied bottom-up 
tree structure is shown graphically in figure ^[ Here 
the nodes are laid out just as in figure [l], but re- 
lated via parent, left and right links, rather than 
the more usual (implicitly ordered) daughter links. 
Notice in particular that the right link from the 
object noun-phrase node points to the preposition 
node, not its phrasal parent - this whole subtree is 
itself encoded bottom-up. Nevertheless, the full tree 



6 LTAG's other tree-building operation is adjunction, 
which allows a tree-fragment to be spliced into the body 
of a tree. However, we only need to concern ourselves 
here with the representation of the trees involved, not 
with the substitution/adjunction distinction. 

7 The tree in figure hi has more than one anchor - in 
such cases it is generally easy to decide which anchor is 
the most appropriate root for the tree (here, the verb 
anchor). 
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structure is completely and accurately represented 
by this encoding. 



s 




P np 
right 

Figure 2: Bottom-up encoding for Give 

Once we adopt this representational strategy, 
writing an LTAG lexicon in DATR becomes similar 
to writing any other type of lexicalist grammar's 
lexicon in an inheritance-based lkrl. In HPSG, for 
example, the subcategorisation frames are coded as 
lists of categories, whilst in LTAG they are coded as 
trees. But, in both cases, the problem is one of con- 
cisely describing feature structures associated with 
lexical entries and relationships between lexical en- 
tries. The same kinds of generalization arise and the 
same techniques are applicable. Of course, the pres- 
ence of complete trees and the fully lexicalizcd ap- 
proach provide scope for capturing generalizations 
lexically that are not available to approaches that 
only identify parent and sibling nodes, say, in the 
lexical entries. 

3 Encoding lexical entries 

Following conventional models of lexicon organisa- 
tion, we would expect Give to have a minimal syn- 
tactic specification itself, since syntactically it is a 
completely regular ditransitive verb. In fact none 
of the information introduced so far is specific to 
Give. So rather than providing a completely ex- 
plicit DATR definition for Give, as we did above, a 
more plausible account uses an inheritance hierar- 
chy defining abstract intransitive, transitive and di- 
transitive verbs to support Give (among others), as 
shown in figure [| 

This basic organisational structure can be ex- 
pressed as the following DATR fragment^]: 

8 To gain the intuitive sense of this fragment, read 
a line such as <> == VERB as "inherit everything from 
the definition of VERB", and a line such as <parent> == 
PPTREE : <> as "inherit the parent subtree from the defi- 



VERB 




Die VERB+NP 




Eat VERB+NP+PP VERB+NP+NP 



Give Spare 

Figure 3: The principal lexical hierarchy 



VERB: 

<> == TREENQDE 
<cat> == v 
<type> == anchor 
<parent> == VPTREE:<>. 

VERB+NP : 

<> == VERB 

<right> == NPC0MP:<>. 

VERB+NP+PP : 

<> == VERB+NP 

<right right> == PTREE:<> 

<right right root> == to . 

VERB+NP+NP : 

<> == VERB+NP 

<right right> == NPC0MP:<>. 

Die: 

<> == VERB 
<root> == die. 

Eat : 

<> == VERB+NP 
<root> == eat . 

Give : 

<> == VERB+NP+PP 
<root> == give. 

Spare : 

<> == VERB+NP+NP 
<root> == spare. 

Ignoring for the moment the references to 
TREENQDE , VPTREE , NPCOMP and PTREE (which we 
shall define shortly), we see that VERB defines basic 
features for all verb entries (and can be used directly 



nition of PPTREE". Inheritance in DATR is always by de- 
fault - locally defined feature specifications take priority 
over inherited ones. 
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for intransitives such as Die), VERB+NP inherits from 
VERB but adds an np complement to the right of 
the verb (for transitives), VERB+NP+PP inherits from 
VERB+NP but adds a further PP complement and so 
on. Entries for regular verb lexemes are then min- 
imal - syntactically they just inherit everything 
from the abstract definitions. 

This DATR fragment is incomplete, because it 
neglects to define the internal structure of the 
TREENODE and the various subtree nodes in the lexi- 
cal hierarchy. Each such node is a description of an 
LTAG tree at some degree of abstraction]]. The fol- 
lowing DATR statements complete the fragment, by 
providing definitions for this internal structure: 

TREENODE: 

<> == undef 

<type> == internal . 

STREE: 

<> == TREENODE 
<cat> == s. 

VP TREE : 

<> == TREENODE 
<cat> == vp 
<parent> == STREE :<> 
<left> == NPC0MP:<>. 

NPCOMP : 

<> == TREENODE 
<cat> == np 

<type> == substitution. 

PPTREE : 

<> == TREENODE 
<cat> == pp. 

PTREE: 

<> == TREENODE 
<cat> == p 
<type> == anchor 
<parent> == PPTREE :<> 

Here, TREENODE represents an abstract node in an 
LTAG tree and provides a (default) type of internal. 
Notice that VERB is itself a TREENODE (but with the 
nondefault type anchor), and the other definitions 
here define the remaining tree nodes that arise in our 
small lexicon: VPTREE is the node for VERB's par- 
ent, STREE for VERB's grandparent, NPCOMP defines 
the structure needed for NP complement substitu- 
tion nodes, etc.Q 

Taken together, these definitions provide a speci- 
fication for Give just as we had it before, but with 

9 Even the lexeme nodes are abstract - individual 
word forms might be represented by further more specific 
nodes attached below the lexemes in the hierarchy. 

10 Our example makes much use of multiple inheri- 
tance (thus, for example, VPTREE inherits from TREENODE, 
STREE and NPCOMP) but all such multiple inheritance is 
orthogonal in DATR: no path can inherit from more than 
one node. 



the addition of type and root features. They also 
support some other verbs too, and it should be clear 
that the basic technique extends readily to a wide 
range of other verbs and other parts of speech. Also, 
although the trees we have described are all initial 
trees (in LTAG terminology), we can describe aux- 
iliary trees, which include a leaf node of type foot 
just as easily. A simple example is provided by the 
following definition for auxiliary verbs: 

AUXVERB : 

<> == TREENODE 
<cat> == v 
<type> == anchor 
<parent cat> == vp 
<right cat> == vp 
<right type> == foot . 

4 Lexical rules 

Having established a basic structure for our LTAG 
lexicon, we now turn our attention towards captur- 
ing other kinds of relationship among trees. We 
noted above that lexical entries are actually asso- 
ciated with tree families, and that these group to- 
gether trees that are related to each other. Thus 
in the same family as a standard ditransitive verb, 
we might find the full passive, the agentless passive, 
the dative alternation, the various relative clauses, 
and so forth. It is clear that these families corre- 
spond closely to the outputs of transformations or 
metarules in other frameworks, but the XTAG sys- 
tem currently has no formal component for describ- 
ing the relationships among families nor mechanisms 
for generating them. And so far we have said noth- 
ing about them either - we have only characterized 
single trees. 

However, LTAG's large domain of locality means 
that all such relationships can be viewed as directly 
lexical, and thus expressible by lexical rules. In fact 
we can go further than this: because we have em- 
bedded the domain of these lexical rules, namely the 
LTAG tree structures, within the feature structures, 
we can view such lexical rules as covariation con- 
straints within feature structures, in much the same 
way that the covariation of, say, syntactic and mor- 
phological form is treated. In particular, we can 
use the mechanisms that DATR already provides for 
feature covariation, rather than having to invoke in 
addition some special purpose lexical rule machin- 
ery. 

We consider six construction types found in the 
XTAG grammar: passive, dative, subject-auxiliary 
inversion, w/i-questions, relative clauses and topical- 
isation. Our basic approach to each of these is the 
same. Lexical rules are specified by defining a de- 
rived output tree structure in terms of an input tree 
structure, where each of these structures is a set of 
feature specifications of the sort defined above. Each 
lexical rule has a name, and the input and output 
tree structures for rule foo are referenced by pre- 
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fixing feature paths of the sort given above with 
•cinput foo . .> or <output foo . .>. So for ex- 
ample, the category of the parent tree node of the 
output of the passive rule might be referenced as 
<output passive parent cat>. We define a very 
general default, stating that the output is the same 
as the input, so that lexical relationships need only 
concern themselves with components they modify. 
This approach to formulating lexical rules in DATR 
is quite general and in no way restricted to LTAG: it 
can be readily adapted for application in the context 
of any feature-based lexicalist grammar formalism. 

Using this approach, the dative lexical rule can be 
given a minimalist implementation by the addition 
of the following single line to VERB+NP+PP, defined 
above. 

VERB+NP+PP : 

•Coutput dative right right> == NPC0MP:<>. 

This causes the second complement to a ditran- 
sitive verb in the dative alternation to be an np, 
rather than a PP as in the unmodified case. Subject- 
auxiliary inversion can be achieved similarly by just 
specifying the output tree structure without refer- 
ence to the input structure (note the addition here 
of a form feature specifying verb form): 

AUXVERB : 

<output auxinv form> == finite-inv 

<output auxinv parent cat> == s 

■Coutput auxinv right cat> == s . 

Passive is slightly more complex, in that it has to 
modify the given input tree structure rather than 
simply overwriting part of it. The definitions for pas- 
sive occur at the VERB+NP node, since by default, any 
transitive or subclass of transitive has a passive form. 
Individual transitive verbs, or whole subclasses, can 
override this default, leaving their passive tree struc- 
ture undefined if required. For agentless passives, 
the necessary additions to the VERB+NP node are as 
follows^]: 

VERB+NP : 

■Coutput passive form> == passive 

■Coutput passive right> == 

"<input passive right right>". 

Here, the first line stipulates the form of the verb 
in the output tree to be passive, while the second line 
redefines the complement structure: the output of 
passive has as its first complement the second com- 
plement of its input, thereby discarding the first 
complement of its input. Since complements are 
daisy-chained, all the others move up too. 

WTi-questions, relative clauses and topicalisation 
are slightly different, in that the application of the 
lexical rule causes structure to be added to the top 



of the tree (above the S node) . Although these con- 
structions involve unbounded dependencies, the un- 
boundedness is taken care of by the LTAG adjunction 
mechanism: for lexical purposes the dependency is 
local. Since the relevant lexical rules can apply to 
sentences that contain any kind of verb, they need 
to be stated at the VERB node. Thus, for exam- 
ple, topicalisation and w/i-questions can be defined 
as follows: 

VERB: 

•Coutput topic parent parent parent cat> 

== s 

■Coutput topic parent parent left cat> == np 
■Coutput topic parent parent left form> 

== normal 

■Coutput whq> == "<output topic>" 

■Coutput whq parent parent left form> == wh. 

Here an additional np and s are attached above 
the original S node to create a topicalised structure. 
The wh-iule inherits from the topicalisation rule, 
changing just one thing: the form of the new NP 
is marked as wh, rather than as normal. In the full 
fragment^, the np added by these rules is also syn- 
tactically cross-referenced to a specific np marked 
as null in the input tree. However, space does not 
permit presentation or discussion of the DATR code 
that achieves this here. 

5 Applying lexical rules 

As explained above, each lexical rule is defined to 
operate on its own notion of an input and produce 
its own output. In order for the rules to have an ef- 
fect, the various input and output paths have to be 
linked together using inheritance, creating a chain of 
inheritances between the base, that is. the canonical 
definitions we introduced in section 3|, and surface 
tree structures of the lexical entry. For example, to 
'apply' the dative rule to our Give definition, we 
could construct a definition such as this: 

Give-dat : 

<> == Give 

■Cinput dative> == <> 
<surface> == <output dative>. 

Values for paths prefixed with surface inherit 
from the output of the dative rule. The input of 
the dative rule inherits from the base (unprefixed) 
case, which inherits from Give. The dative rule def- 
inition (just the one line introduced above, plus the 
default that output inherits from input) thus me- 
diates between Give and the surface of Give-dat. 
This chain can be extended by inserting additional 
inheritance specifications (such as passive). Note 
that surface defaults to the base case, so all en- 
tries have a surface defined. 



Oversimplifying slightly, the double quotes in 
"■cinput passive right right>" mean that that DATR 
path will not be evaluated locally (i.e., at the VERB+NP 
node), but rather at the relevant lexeme node (e.g., Eat 
or Give). 



12 The full version of this DATR fragment includes all 
the components discussed above in a single coherent, but 
slightly more complex account. It is available on request 
from the authors. 
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However, in our full fragment, additional support 
is provided to achieve and constrain this rule chain- 
ing. Word definitions include boolean features in- 
dicating which rules to apply, and the presence of 
these features trigger inheritance between appro- 
priate input and output paths and the base and 
surface specifications at the ends of the chain. For 
example, Wordl is an alternative way of specifying 
the dative alternant of Give, but results in inheri- 
tance linking equivalent to that found in Give-dat 
above: 

Wordl : 

<> == Give 

<alt dative> == true. 

More interestingly, Word2 properly describes a wh- 
question based on the agentless passive of the dative 
of Give. 

Word2 : 

<> == Give 

<alt whq> == true 

<alt dative> == true 

<alt passive> == true. 

<parent left form> == null 

Notice here the final line of Word2 which specifies 
the location of the 'extracted' np (the subject, in this 
case), by marking it as null. As noted above, the full 
version of the whq lexical rule uses this to specify a 
cross-reference relationship between the wh-NP and 
the null np. 

We can, if we wish, encode constraints on the ap- 
plicability of rules in the mapping from boolean flags 
to actual inheritance specifications. Thus, for exam- 
ple, whq, rel, and topic are mutually exclusive. 
If such constraints are violated, then no value for 
surface gets defined. Thus Word3 improperly at- 
tempts topicalisation in addition to w/i-question for- 
mation, and, as a result, will fail to define a surface 
tree structure at all: 

Word3: 

<> == Give 

<alt whq> == true 

<alt topic> == true 

<alt dative> == true 

<alt passive> == true 

<parent left form> == null. 

This approach to lexical rules allows them to be 
specified at the appropriate point in the lexical hi- 
erarchy, but overridden or modified in subclasses or 
lexemes as appropriate. It also allows default gener- 
alisation over the lexical rules themselves, and con- 
trol over their application. The last section showed 
how the whq lexical rule could be built by a single mi- 
nor addition to that for topicalisation. However, it is 
worth noting that, in common with other DATR spec- 
ifications, the lexical rules presented here are rule 
instances which can only be applied once to any 
given lexeme - multiple application could be sup- 
ported, by making multiple instances inherit from 
some common rule specification, but in our current 
treatment such instances would require different rule 
names. 



6 Comparison with related work 



As noted above, Vijay-Shankcr & Schabes (1992) 
have also proposed an inheritance-based approach 
to this problem. They use monotonic inheritance to 
build up partial descriptions of trees: each descrip- 
tion is a finite set of dominance, immediate domi- 
nance and linear precedence statements about tree 
nodes in a tree descripti on la nguage developed by 
Rogers & Vijay-Shanker ( 1992| ), and category infor- 
mation is located in the node labels. 

This differs from our approach in a number of 
ways. First, our use of nonmonotonic inheritance 
allows us to manipulate total instead of partial de- 
scriptions of trees. The abstract verb class in the 
Vijay-Shanker & Schabes account subsumes both in- 
transitive and transitive verb classes but is not iden- 
tical to either - a minimal-satisfying-model step is 
required to map partial tree descriptions into ac- 
tual trees. In our analysis, VERB is the intransitive 
verb class, with complements specifically marked as 
undefined: thus VERB : <right> == undef is inher- 
ited from TREENODE and VERB+NP just overrides this 
complement specification to add an np complement. 
Second, we describe trees using only local tree re- 
lations (between adjacent nodes in the tree), while 
Vijay-Shanker & Schabes also use a nonlocal domi- 
nance relation. 

Both these properties are crucial to our embed- 
ding of the tree structure in the feature structure. 
We want the category information at each tree node 
to be partial in the conventional sense, so that in 
actual use such categories can be extended (by uni- 
fication or whatever) . So the feature structures that 
we associate with lexical entries must be viewed as 
partial. But we do not want the tree structure to 
be extendible in the same way: we do not want an 
intransitive verb to be applicable in a transitive con- 
text, by unifying in a complement np. So the tree 
structures we define must be total descriptions^. 
And of course, our use of only local relations al- 
lows a direct mapping from tree structure to feature 
path, which would not be possible at all if nonlocal 
relations were present. 

So while these differences may seem small, they al- 
low us to take this significant representational step 
- significant because it is the tree structure embed- 
ding that allows us to view lexical rules as feature co- 
variation constraints. The result is that while Vijay- 
Shanker & Schabes use a tree description language, a 
category description language and a further formal- 
ism for lexical rules, we can capture everything in 
one framework all of whose components (nonmono- 
tonicity, covariation constraint handling, etc.) have 



13 Note that simplified fragment presented here does 
not get this right. It makes all feature specifications total 
descriptions. To correct this we would need to change 
TREENODE so that only the values of <right>, <lef t> and 
<parent> default to undef. 
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already been independently motivated for other as- 
pects of lexical description^ 

Becker's recent work (1993; |1994 ) is also directed 
at exactly the problem we address in the present 
paper. Like him, we have employed an inheritance 
hierarchy. And, like him, we have employed a set of 
lexical rules (corresponding to his metarules). The 
key differences between our account and his are (i) 
that we have been able to use an existing lexical 
knowledge representation language, rather than de- 
signing a formal system that is specific to LTAG, and 
(ii) that we have expressed our lexical rules in ex- 
actly the same language as that we have used to 
define the hierarchy, rather than invoking two quite 
different formal systems. 

Becker's sharp distinction between his metarules 
and his hierarchy gives rise to some problems that 
our approach avoids. Firstly, he notes that his 
metarules are subject to lexical exceptions and pro- 
poses to deal with these by stating "for each entry 
in the (syntactic) lexicon . . which metarules are ap- 
plicable for this entry" (f993,126). We have no need 
to carry over this use of (meta)rule features since, in 
our account, lexical rules are not distinct from any 
other kind of property in the inheritance hierarchy. 
They can be stated at the most inclusive relevant 
node and can then be overridden at the exceptional 
descendant nodes. Nothing specific needs to be said 
about the nonexceptional nodes. 

Secondly, his metarules may themselves be more 
or less similar to each other and he suggests 
(1994, ff) that these similarities could be captured 
if the metarules were also to be organized in a hier- 
archy. However, our approach allows us to deal with 
any such similarities in the main lexical hierarchy 
itsehfi rather than by setting up a separate hierar- 
chical component just for metarules (which appears 
to be what Becker has in mind). 

Thirdly, as he himself notes (f993,128), because 
his metarules map from elementary trees that are in 
the inheritance hierarchy to elementary trees that 
are outside it, most of the elementary trees actually 
used are not directly connected to the hierarchy (al- 
though their derived status with respect to it can be 
reconstructed). Our approach keeps all elementary 
trees, whether or not they have been partly defined 
by a lexical rule, entirely within the lexical hierarchy. 

In fact, Becker himself considers the possibility of 
capturing all the significant generalizations by us- 
ing just one of the two mechanisms that he pro- 
poses: "one might want to reconsider the usage of 
one mechanism for phenomena in both dimensions" 
(1993,135). But, as he goes on to point out, his ex- 
isting type of inheritance network is not up to taking 



14 As in the work cited in footnote 3, above. 

15 As illustrated by the way in which the whq lexical 
rule inherits from that for topicalisation in the example 
given above. 



on the task performed by his metarules because the 
former is monotonic whilst his metarules are not. 
However, he does suggest a way in which the hierar- 
chy could be completely replaced by metarules but 
argues against adopting it (1993,136). 

As will be apparent from the earlier sections of 
this paper, we believe that Becker's insights about 
the organization of an LTAG lexicon can be better 
expressed if the metarule component is replaced by 
an encoding of (largely equivalent) lexical rules that 
are an integral part of a nonmonotonic inheritance 
hierarchy that stands as a description of all the ele- 
mentary trees. 
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